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SEQUENCE LISTING 



<110> Bisgard-Frantzen, Henrik 
Svendsen, Allan 
Borchert, Torben Vedel 

<12 0> Amylase Variants 



<130> 4318.224-US 

<140> 09/902,188 

<141> 2001-07-10 

<150> 08/600,656 

<151> 1996-02-13 



<150> PCT/DK96/00056 
<151> 1996-02-05 



<150> Denmark 

<151> 1995-02-03 

<150> Denmark 

<151> 1995-03-29 



<150> Denmark 
<151> 1995-09-29 

<150> Denmark 
<151> 1995-10-06 

<160> 32 

<170> FastSEQ for Windows Version 3.0 
<210> 1 
<400> 1 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide ' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 



1 



5 



10 



15 



Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Gly Thr Glu He Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

He Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

He -Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 



Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 



His Pro Thr His 



Gly Glu Ala Leu 
340 

Tyr Ala Leu Val 
355 

Gly Asp Tyr Tyr 
370 

Lys lie Asp Pro 
385 

Gin His Asp Tyr 



Gly Asn Ser Ser 
420 

Gly Pro Gly Gly 
435 

Gin Val Trp Arg 
450 

Asn Ala Asp Gly 
465 

Val Trp Val Lys 



Ala Val Thr Phe 
325 

Glu Ser Phe Val 



Leu Thr Arg Glu 
360 

Gly lie Pro Thr 
375 

Leu Leu Gin Ala 
390 

Phe Asp His His 
405 

His Pro Asn Ser 



Asn Lys Trp Met 
440 

Asp lie Thr Gly 
455 

Trp Gly Asn Phe 
470 

Gin 
485 



Val Asp Asn His 
330 

Gin Gin Trp Phe 
345 

Gin Gly Tyr Pro 



His Gly Val Pro 
380 

Arg Gin Thr Phe 
395 

Asp lie lie Gly 
410 

Gly Leu Ala Thr 
425 

Tyr Val Gly Lys 



Asn Arg Thr Gly 
460 

Ser Val Asn Gly 
475 



Asp Ser Gin Pro 
335 

Lys Pro Leu Ala 
350 

Ser Val Phe Tyr 
365 

Ala Met Lys Ser 



Ala Tyr Gly Thr 
400 

Trp Thr Arg Glu 
415 

lie Met Ser Asp 
430 

Asn Lys Ala Gly 
445 

Thr Val Thr He 



Gly Ser Val Ser 
480 



INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
20 25 30 

Asn Leu Arg Asn Arg Gly He Thr Ala He Trp He Pro Pro Ala Trp 



35 



40 



45 



Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Ser Gin Leu Glu Ser Ala lie His Ala Leu Lys Asn Asn Gly 
85 90 95 

Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu lie Ser Gly Asp Tyr Thr lie Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
165 170 175 

lie Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 
245 250 255 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 
305 310 315 320 

His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 



Gly Glu Ser Leu 
340 

Tyr Ala Leu lie 
355 

Gly Asp Tyr Tyr 
370 

Lys lie Asp Pro 
385 

Gin His Asp Tyr 



Gly Asn Thr Thr 
420 

Gly Pro Gly Gly 
435 

Gin Val Trp His 
450 

Asn Ala Asp Gly 
465 

He Trp Val Lys 



Glu Ser Phe Val 



Leu Thr Arg Glu 
360 

Gly He Pro Thr 
375 

He Leu Glu Ala 
390 

Phe Asp His His 
405 

His Pro Asn Ser 



Glu Lys Trp Met 
440 

Asp He Thr Gly 
455 

Trp Ala Asn Phe 
470 

Arg 
485 



Gin Glu Trp Phe 
345 

Gin Gly Tyr Pro 



His Ser Val Pro 
380 

Arg Gin Asn Phe 
395 

Asn lie lie Gly 
410 

Gly Leu Ala Thr 
425 

Tyr Val Gly Gin 



Asn Lys Pro Gly 
460 

Ser Val Asn Gly 
475 



Lys Pro Leu Ala 
350 

Ser Val Phe Tyr 
365 

Ala Met Lys Ala 



Ala Tyr Gly Thr 
400 

Trp Thr Arg Glu 
415 

lie Met Ser Asp 
430 

Asn Lys Ala Gly 
445 

Thr Val Thr lie 



Gly Ser Val Ser 
480 



INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu 
15 10 15 

Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
20 25 30 

Leu Ser Ser Leu Gly lie Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
35 40 45 



Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
50 55 60 



Leu Gly Glu Phe Asn Gin Lys Gly Ala Val Arg Thr Lys Tyr Gly Thr 
65 70 75 80 



Lys Ala 



Gin Tyr Leu 
85 



Gin Ala He Gin 



Ala Ala His 
90 



Ala Ala Gly Met 
95 



Gin Val 



Tyr Ala Asp 
100 



Val Val 



Phe Asp 
105 



His Lys Gly 



Gly Ala Asp Gly 
110 



Thr Glu 



Trp Val Asp 
115 



Ala Val 



Glu Val 
120 



Asn Pro Ser 



Asp Arg Asn Gin 
125 



Glu He 
130 

Pro Gly 
145 



Ser Gly Thr 
Arg Gly Asn 



Tyr Gin 
135 

Thr Tyr 
150 



He Gin 



Ser Ser 



Ala Trp Thr 
140 

Phe Lys Trp 
155 



Lys Phe Asp Phe 



Arg Trp Tyr His 
160 



Phe Asp 
Lys Phe 
Asn Gly 



Gly Val Asp 
165 

Arg Gly He 
180 

Asn Tyr Asp 
195 



Trp Asp Glu Ser 



Gly Lys 



Tyr Leu 



Ala Trp 
185 

Met Tyr 
200 



Arg Lys Leu 
170 

Asp Trp Glu 
Ala Asp Leu 



Ser Arg lie Tyr 
175 

Val Asp Thr Glu 
190 

Asp Met Asp His 
205 



Pro Glu 
210 

Thr Thr 
225 



Val Val Thr 



Asn lie Asp 



Glu Leu 
215 

Gly Phe 
230 



Lys Ser 
Arg Leu 



Trp Gly Lys 
220 

Asp Ala Val 
235 



Trp Tyr Val Asn 



Lys His lie Lys 
240 



Phe Ser 



Phe Phe Pro 
245 



Asp Trp Leu Ser 



Asp Val Arg 
250 



Ser Gin Thr Gly 
255 



Lys Pro 



Leu His 



Leu Phe Thr 
260 

Asn Tyr lie 
275 



Val Gly 



Met Lys 



Glu Tyr 
265 

Thr Asn 
280 



Trp Ser Tyr 
Gly Thr Met 



Asp He Asn Lys 
270 

Ser Leu Phe Asp 
285 



Ala Pro 
290 



Leu His Asn 



Lys Phe 
295 



Tyr Thr 



Ala Ser Lys 
300 



Ser Gly Gly Thr 



Phe Asp 
305 



Met Arg Thr 



Leu Met 
310 



Thr Asn 



Thr Leu Met 
315 



Lys Asp Gin Pro 
320 



Thr ' Leu 



Ala Val Thr 
325 



Phe Val Asp Asn 



His Asp Thr 
330 



Glu Pro Gly Gin 
335 



Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 
340 345 350 



Phe lie Leu Thr 
355 

Tyr Tyr Gly lie 
370 

Asp Pro Leu Leu 
385 

Asp Tyr Leu Asp 



Thr Glu Lys Pro 
420 

Gly Gly Ser Lys 
435 

Phe Tyr Asp Leu 
450 

Asp Gly Trp Gly 
465 

Val Pro Arg Lys 



Arg Pro Trp Thr 
500 

Ala Trp 



Arg Gin Glu Gly 
360 

Pro Gin Tyr Asn 
375 

lie Ala Arg Arg 
390 

His Ser Asp lie 
405 

Gly Ser Gly Leu 



Trp Met Tyr Val 
440 

Thr Gly Asn Arg 
455 

Glu Phe Lys Val 
470 

Thr Thr Val Ser 
485 

Asp Glu Phe Val 



Tyr Pro Cys Val 



lie Pro Ser Leu 
380 

Asp Tyr Ala Tyr 
395 

lie Gly Trp Thr 
410 

Ala Ala Leu lie 
425 

Gly Lys Gin His 



Ser Asp Thr Val 
460 

Asn Gly Gly Ser 
475 

Thr lie Ala Trp 
490 

Arg Trp Thr Glu 
505 



Phe Tyr Gly Asp 
365 

Lys Ser Lys lie 



Gly Thr Gin His 
400 

Arg Glu Gly Val 
415 

Thr Asp Gly Pro 
430 

Ala Gly Lys Val 
445 

Thr lie Asn Ser 



Val Ser Val Trp 
480 

Ser lie Thr Thr 
495 

Pro Arg Leu Val 
510 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CATCATAATG GAACAAATGG TACTATGATG CAATATTTCG AATGGTATTT GCCAAATGAC 60 

GGGAATCATT GGAACAGGTT GAGGGATGAC GCAGCTAACT TAAAGAGTAA AGGGATAACA 12 0 

GCTGTATGGA TCCCACCTGC ATGGAAGGGG ACTTCCCAGA ATGATGTAGG TTATGGAGCC 18 0 

TATGATTTAT ATGATCTTGG AGAGTTTAAC CAGAAGGGGA CGGTTCGTAC AAAATATGGA 24 0 

ACACGCAACC AGCTACAGGC TGCGGTGACC TCTTTAAAAA ATAACGGCAT TCAGGTATAT 300 

GGTGATGTCG TCATGAATCA TAAAGGTGGA GCAGATGGTA CGGAAATTGT AAATGCGGTA 360 



GAAGTGAATC GGAGCAACCG AAACCAGGAA ACCTCAGGAG AGTATGCAAT AGAAGCGTGG 420 

ACAAAGTTTG ATTTTCCTGG AAGAGGAAAT AACCATTCCA GCTTTAAGTG GCGCTGGTAT 480 

CATTTTGATG GGACAGATTG GGATCAGTCA CGCCAGCTTC AAAACAAAAT AT AT AAATT C 54 0 

AGGGGAACAG GCAAGGCCTG GGACTGGGAA GTCGATACAG AGAATGGCAA CTATGACTAT 600 

CTTATGTATG CAGACGTGGA TATGGATCAC CCAGAAGTAA TACATGAACT TAGAAACTGG 660 

GGAGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGATGC AGTGAAACAT 72 0 

ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 780 

ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTATTTGAAT 84 0 

AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 900 

TCTAATAGCG GTGGTTATTA TGATATGAGA AATATTTTAA ATGGTTCTGT GGTGCAAAAA 960 

CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 1020 

GAATCCTTTG TTCAACAATG GTTTAAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 1080 

CAAGGTTATC CTTCCGTATT TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 114 0 

GCTATGAAAT CTAAAATAGA CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 1200 

CAGCATGATT ACTTTGATCA TCATGATATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 1260 

CATCCAAATT CAGGCCTTGC CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 132 0 

TATGTGGGGA AAAATAAAGC GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 1380 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 144 0 

GTTTGGGTGA AGCAA 14 55 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 60 



GGGAATCACT GGAATAGATT AAGAGATGAT GCTAGTAATC TAAGAAATAG AGGTATAACC 



120 



GCTATTTGGA TTCCGCCTGC CTGGAAAGGG ACTTCGCAAA ATGATGTGGG GTATGGAGCC 180 

TATGATCTTT ATGATTTAGG GGAATTTAAT CAAAAGGGGA CGGTTCGTAC TAAGTATGGG 24 0 

ACACGTAGTC AATTGGAGTC TGC CATC CAT GCTTTAAAGA ATAATGGCGT TCAAGTTTAT 3 00 

GGGGATGTAG TGATGAACCA TAAAGGAGGA GCTGATGCTA CAGAAAACGT TCTTGCTGTC 360 

GAGGTGAATC CAAATAACCG GAATCAAGAA ATATCTGGGG ACTACACAAT TGAGGCTTGG 42 0 

ACTAAGTTTG ATTTTCCAGG GAGGGGTAAT ACATACTCAG ACTTTAAATG GCGTTGGTAT 48 0 

CATTTCGATG GTGTAGATTG GGATCAATCA CGACAATTCC AAAATCGTAT CTACAAATTC 54 0 

CGAGGTGATG GTAAGGCATG GGATTGGGAA GTAGATTCGG AAAATGGAAA TTATGATTAT 600 

TTAATGTATG CAGATGTAGA TATGGATCAT CCGGAGGTAG TAAATGAGCT TAGAAGATGG 660 

GGAGAATGGT ATACAAATAC ATTAAATCTT GATGGATTTA GGATCGATGC GGTGAAGCAT 720 

ATTAAATATA GCTTTACACG TGATTGGTTG ACCCATGTAA GAAACGCAAC GGGAAAAGAA 780 

ATGTTTGCTG TTGCTGAATT TTGGAAAAAT GATTTAGGTG CCTTGGAGAA CTATTTAAAT 84 0 

AAAACAAACT GGAATCATTC TGTCTTTGAT GTCCCCCTTC ATTATAATCT TTATAACGCG 900 

TCAAATAGTG GAGGCAACTA TGACATGGCA AAACTTCTTA ATGGAACGGT TGTTCAAAAG 960 

CATCCAATGC ATGCCGTAAC TTTTGTGGAT AATCACGATT CTCAACCTGG GGAATCATTA 102 0 

GAATCATTTG TACAAGAATG GTTTAAGCCA CTTGCTTATG CGCTTATTTT AACAAGAGAA 1080 

CAAGGCTATC CCTCTGTCTT CTATGGTGAC TACTATGGAA TTCCAACACA TAGTGTCCCA 114 0 

GCAATGAAAG CCAAGATTGA TCCAATCTTA GAGGCGCGTC AAAATTTTGC ATATGGAACA 1200 

CAACATGATT ATTTTGACCA TCATAATATA ATCGGATGGA CACGTGAAGG AAATACCACG 1260 

CATCCCAATT CAGGACTTGC GACTATCATG TCGGATGGGC CAGGGGGAGA GAAATGGATG 1320 

TACGTAGGGC AAAATAAAGC AGGTCAAGTT TGGCATGACA TAACTGGAAA TAAACCAGGA 13 80 

ACAGTTACGA TCAATGCAGA TGGATGGGCT AATTTTTCAG TAAATGGAGG ATCTGTTTCC 144 0 

ATTTGGGTGA AACGA 1455 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCCGCACCGT TTAACGGCAC CATGATGCAG TATTTTGAAT GGTACTTGCC GGATGATGGC 60 

ACGTTATGGA CCAAAGTGGC CAATGAAGCC AACAACTTAT CCAGCCTTGG CATCACCGCT 12 0 

CTTTGGCTGC CGCCCGCTTA CAAAGGAACA AGCCGCAGCG ACGTAGGGTA CGGAGTATAC 18 0 

GACTTGTATG ACCTCGGCGA ATTCAATCAA AAAGGGACCG TCCGCACAAA ATACGGAACA 24 0 

AAAGCTCAAT ATCTTCAAGC CATTCAAGCC GCCCACGCCG CTGGAATGCA AGTGTACGCC 3 00 

GATGTCGTGT TCGACCATAA AGGCGGCGCT GACGGCACGG AATGGGTGGA CGCCGTCGAA 360 

GTCAATCCGT CCGACCGCAA CCAAGAAATC TCGGGCACCT ATCAAATCCA AGCATGGACG 420 

AAATTTGATT TTCCCGGGCG GGGCAACACC TACTCCAGCT TTAAGTGGCG CTGGTACCAT 4 80 

TTTGACGGCG TTGATTGGGA CGAAAGCCGA AAATTGAGCC GCATTTACAA MTCCGCGGC 540 

ATCGGCAAAG CGTGGGATTG GGAAGTAGAC ACGGAAAACG GAAACTATGA CTACTTAATG 600 

TATGCCGACC TTGATATGGA TCATCCCGAA GTCGTGACCG AGCTGAAAAA CTGGGGGAAA 660 

TGGTATGTCA ACACAACGAA CATTGATGGG TTCCGGCTTG ATGCCGTCAA GCATATTAAG 72 0 

TTCAGTTTTT TTCCTGATTG GTTGTCGTAT GTGCGTTCTC AGACTGGCAA GCCGCTATTT 7 80 

ACCGTCGGGG AATATTGGAG CTATGACATC AACAAGTTGC ACAATTACAT TACGAAAACA 84 0 

GACGGAACGA TGTCTTTGTT TGATGCCCCG TTACACAACA AATTTTATAC CGCTTCCAAA 900 

TCAGGGGGCG CATTTGATAT GCGCACGTTA ATGACCAATA CTCTCATGAA AGATCAACCG 960 

ACATTGGCCG TCACCTTCGT TGATAATCAT GACACCGAAC CCGGCCAAGC GCTGCAGTCA 1020 

TGGGTCGACC CATGGTTCAA ACCGTTGGCT TACGCCTTTA TTCTAACTCG GCAGGAAGGA 108 0 

TACCCGTGCG TCTTTTATGG TGACTATTAT GGCATTCCAC AATATAACAT TCCTTCGCTG 114 0 

AAAAGCAAAA TCGATCCGCT CCTCATCGCG CGCAGGGATT ATGCTTACGG AACGCAACAT 12 00 

GATTATCTTG ATCACTCCGA CATCATCGGG TGGACAAGGG AAGGGGGCAC TGAAAAACCA 1260 

GGATCCGGAC TGGCCGCACT GATCACCGAT GGGCCGGGAG GAAGCAAATG GATGTACGTT 1320 

GGCAAACAAC ACGCTGGAAA AGTGTTCTAT GACCTTACCG GCAACCGGAG TGACACCGTC 13 80 

ACCATCAACA GTGATGGATG GGGGGAATTC AAAGTCAATG GCGGTTCGGT TTCGGTTTGG 144 0 

GTTCCTAGAA AAACGACCGT TTCTACCATC GCTCGGCCGA TCACAACCCG ACCGTGGACT 1500 

GGTGAATTCG TCCGTTGGAC CGAACCACGG TTGGTGGCAT GGCCTTGA 154 8 



INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Asn Ser Asp Ala Ser 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 

Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Ser Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Ala Thr Glu Met Val Arg Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu Val Thr Gly Glu Tyr Thr He Glu Ala Trp Thr Arg Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Arg Leu Asn Asn Arg 

165 170 175 

He Tyr Lys Phe Arg Gly His Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 



Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 



lie Lys Tyr Ser Phe Thr Arg Asp Trp lie Asn His Val Arg Ser Ala 
245 250 255 



Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala lie Glu Asn Tyr Leu Gin Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Arg Asn lie Phe Asn Gly Thr Val Val Gin Arg 
305 310 315 320 

His Pro Ser His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Glu Glu Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu Thr Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Arg Ser 
370 375 380 

Lys lie Asp Pro lie Leu Glu Ala Arg Gin Lys Tyr Ala Tyr Gly Lys 
385 390 395 400 

Gin Asn Asp Tyr Leu Asp His His Asn lie lie Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
420 425 430 

Gly Ala Gly Gly Ser Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly 
435 440 445 

Gin Val Trp Ser Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr lie 
450 455 460 



Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 



lie Trp Val Asn Lys 
485 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GCTGCGGTGA CCTCTTTAAA AAATAACGGC 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCACCGCTAT TAGATGCATT GTAC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10 
CTTACGTATG CAGACGTCGA TATGGATCAC CC 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GATCCATATC GACGTCTGCA TACGTAAGAT AGTC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TTASGGGCAA GGCCTGGGAC TGG 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCCAGGCCTT GCCCSTAAAT TTATATATTT TGTTTTG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GGTTTCGGTT CGAAGGATTC ACTTCTACCG C 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GCGGTAGAAG TGAATCCTTC GAACCGAAAC CAG 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



GGTACTATCG TAACAATGGC CGATTGCTGA CGCTGTTATT TGC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTGTGACTGG TGAGTACTCA ACCAAGTC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTACTTCCCA ATCCCAAGCT TTACCTCGGA ATTTG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAAATTCCGA GGTAAAGCTT GGGATTGGGA AGTAG 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTGAACAACC GTTCCATTAA GAAG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CTCTGTATCG ACTTCCCAGT CCCAAGCTTT TGTCCTGAAT TTATATATTT TGTTTTGAAG 60 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CTCTGTATCG ACTTCCCAGT CCCAAGCTTT GCCTCCGAAT TTATATATTT TGTTTTGAAG 60 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
ATGTGTAAGC CAATCGCGAG TAAAGCTAAA TTTTATATGT TTCACTGCAT C 51 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCACCAAGGT CATTTCGCCA GAATTCAGCC ACTG 34 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TGTCAGAACC AACGCGTATG CACATGGTTT AAACCATTG 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
ACCACCTGGA CCATCGCTGC AGATGGTGGC AAGGCCTGAA TT 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GGCAAAAGTT TGACGTGCCT CGAGAAGAGG GTCTAT 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
TTGTCCCGCT TTATTCTGGC CAACATACAT CCATTT 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CCCAATCCCA AGCTTTACCA YCGAACTTGT AGATACG 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CCCAATCCCA AGCTTTATCT CSGAACTTGT AGATACG 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GATCCATATC GACGTCTGCA TACAGTAAAT AATC 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32 
GATCCATATC GACGTCTGCA TAAATTAAAT AATC 



SEQUENCE LISTING 



<110> Bisgard-Frantzen, Henrik 
Svendsen, Allan 
Borchert, Torben Vedel 

<120> Amylase Variants 



<130> 4318.224-US 

<140> 09/902,188 
<141> 2001-07-10 

<150> 08/600,656 
<151> 1996-02-13 

<150> PCT/DK96/00056 
<151> 1996-02-05 

<150> Denmark 
<151> 1995-02-03 

<150> Denmark 
<151> 1995-03-29 

<150> Denmark 
<151> 1995-09-29 

<150> Denmark 
<151> 1995-10-06 

<160> 32 

<170> FastSEQ for Windows Version 3.0 
<210> 1 
<400> 1 

INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 



Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 



Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Gly Thr Glu He Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

He Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val He His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn He Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu. Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 



Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 



Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 



Lys lie Asp Pro 
385 

Gin His Asp Tyr 



Gly Asn Ser Ser 
420 

Gly Pro Gly Gly 
435 

Gin Val Trp Arg 
450 

Asn Ala Asp Gly 
465 

Val Trp Val Lys 



Leu Leu Gin Ala 
390 

Phe Asp His His 
405 

His Pro Asn Ser 



Asn Lys Trp Met 
440 

Asp lie Thr Gly 
455 

Trp Gly Asn Phe 
470 

Gin 
485 



Arg Gin Thr Phe 
395 

Asp lie lie Gly 
410 

Gly Leu Ala Thr 
425 

Tyr Val Gly Lys 



Asn Arg Thr Gly 
460 

Ser Val Asn Gly 
475 



Ala Tyr Gly Thr 
400 

Trp Thr Arg Glu 
415 

lie Met Ser Asp 
430 

Asn Lys Ala Gly 
445 

Thr Val Thr He 



Gly Ser Val Ser 
480 



INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
15 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
20 25 30 

Asn Leu Arg Asn Arg Gly He Thr Ala He Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Ser Gin Leu Glu Ser Ala He His Ala Leu Lys Asn Asn Gly 
85 90 95 

Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu lie Ser Gly Asp Tyr Thr He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 



Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 



145 



150 



155 



160 



His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
165 170 175 

lie Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 
245 250 255 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 
305 310 315 320 

His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ser Leu Glu Ser Phe Val Gin Glu Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu lie Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Ser Val Pro Ala Met Lys Ala 
370 375 380 

Lys lie Asp Pro lie Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asn lie lie Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
420 425 430 

Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 
435 440 445 

Gin Val Trp His Asp lie Thr Gly Asn Lys Pro Gly Thr Val Thr lie 
450 455 460 



Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 



lie Trp Val Lys Arg 



485 



INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu 
15 10 15 

Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
20 25 30 

Leu Ser Ser Leu Gly lie Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
35 40 45 

Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
50 55 60 

Leu Gly Glu Phe Asn Gin Lys Gly Ala Val Arg Thr Lys Tyr Gly Thr 
65 70 75 80 

Lys Ala Gin Tyr Leu Gin Ala lie Gin Ala Ala His Ala Ala Gly Met 
85 90 95 

Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly 
100 105 110 

Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin 
115 120 125 

Glu lie Ser Gly Thr Tyr Gin lie Gin Ala Trp Thr Lys Phe Asp Phe 
130 135 140 

Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 
145 150 155 160 

Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg lie Tyr 
165 170 175 

Lys Phe Arg Gly lie Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 
180 185 190 

Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His 
195 200 205 

Pro Glu Val Val Thr Glu Leu Lys Ser Trp Gly Lys Trp Tyr Val Asn 
210 215 220 

Thr Thr Asn lie Asp Gly Phe Arg Leu Asp Ala Val Lys His lie Lys 
225 230 235 240 



Phe Ser Phe Phe Pro Asp Trp Leu Ser Asp Val Arg Ser Gin Thr Gly 
245 250 255 



Lys Pro Leu Phe Thr Val Gly Glu 
260 

Leu His Asn Tyr lie Met Lys Thr 
275 280 

Ala Pro Leu His Asn Lys Phe Tyr 
290 295 

Phe Asp Met Arg Thr Leu Met Thr 
305 310 



Tyr Trp Ser Tyr Asp lie Asn Lys 
265 270 

Asn Gly Thr Met Ser Leu Phe Asp 
285 

Thr Ala Ser Lys Ser Gly Gly Thr 
300 

Asn Thr Leu Met Lys Asp Gin Pro 
315 320 



Thr Leu Ala Val 



Ala Leu Gin Ser 
340 

Phe lie Leu Thr 
355 

Tyr Tyr Gly lie 
370 

Asp Pro Leu Leu 
385 

Asp Tyr Leu Asp 



Thr Glu Lys Pro 
420 

Gly Gly Ser Lys 
435 

Phe Tyr Asp Leu 
450 



Thr Phe Val Asp 
325 

Trp Val Asp Pro 



Arg Gin Glu Gly 
360 

Pro Gin Tyr Asn 
375 

lie Ala Arg Arg 
390 

His Ser Asp lie 
405 

Gly Ser Gly Leu 



Trp Met Tyr Val 
440 

Thr Gly Asn Arg 
455 



Asn His Asp Thr 
330 

Trp Phe Lys Pro 
345 

Tyr Pro Cys Val 



lie Pro Ser Leu 
380 

Asp Tyr Ala Tyr 
395 

lie Gly Trp Thr 
410 

Ala Ala Leu lie 
425 

Gly Lys Gin His 



Ser Asp Thr Val 
460 



Glu Pro Gly Gin 
335 

Leu Ala Tyr Ala 
350 

Phe Tyr Gly Asp 
365 

Lys Ser Lys lie 



Gly Thr Gin His 
400 

Arg Glu Gly Val 
415 

Thr Asp Gly Pro 
430 

Ala Gly Lys Val 
445 

Thr lie Asn Ser 



Asp Gly Trp Gly Glu Phe Lys Val 
465 470 

Val Pro Arg Lys Thr Thr Val Ser 
485 

Arg Pro Trp Thr Asp Glu Phe Val 
500 

Ala Trp 



Asn Gly Gly Ser Val Ser Val Trp 
475 480 

Thr lie Ala Trp Ser lie Thr Thr 
490 495 

Arg Trp Thr Glu Pro Arg Leu Val 
505 510 



INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CATCATAATG GAACAAATGG TACTATGATG CAATATTTCG AATGGTATTT GCCAAATGAC 60 

GGGAATCATT GGAACAGGTT GAGGGATGAC GCAGCTAACT TAAAGAGTAA AGGGATAACA 120 

GCTGTATGGA TCCCACCTGC ATGGAAGGGG ACTTCCCAGA ATGATGTAGG TTATGGAGCC 180 

TATGATTTAT ATGATCTTGG AGAGTTTAAC CAGAAGGGGA CGGTTCGTAC AAAATATGGA 24 0 

ACACGCAACC AGCTACAGGC TGCGGTGACC TCTTTAAAAA ATAACGGCAT TCAGGTATAT 300 

GGTGATGTCG TCATGAATCA TAAAGGTGGA GCAGATGGTA CGGAAATTGT AAATGCGGTA 360 

GAAGTGAATC GGAGCAACCG AAACCAGGAA ACCTCAGGAG AGTATGCAAT AGAAGCGTGG 420 

ACAAAGTTTG ATTTTCCTGG AAGAGGAAAT AACCATTCCA GCTTTAAGTG GCGCTGGTAT 4 80 

CATTTTGATG GGACAGATTG GGATCAGTCA CGCCAGCTTC AAAACAAAAT ATATAAATTC 54 0 

AGGGGAACAG GCAAGGCCTG GGACTGGGAA GTCGATACAG AGAATGGCAA CTATGACTAT 600 

CTTATGTATG CAGACGTGGA TATGGATCAC CCAGAAGTAA TACATGAACT TAGAAACTGG 660 

GGAGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGATGC AGTGAAACAT 72 0 

ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 78 0 

ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTATTTGAAT 84 0 

AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 900 

TCTAATAGCG GTGGTTATTA TGATATGAGA AATATTTTAA ATGGTTCTGT GGTGCAAAAA 960 

CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 102 0 

GAATCCTTTG TTCAACAATG GTTTAAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 1080 

CAAGGTTATC CTTCCGTATT TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 114 0 

GCTATGAAAT CTAAAATAGA CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 1200 

CAGCATGATT ACTTTGATCA TCATGATATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 1260 

CATCCAAATT CAGGCCTTGC CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 132 0 

TATGTGGGGA AAAATAAAGC GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 1380 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 1440 

GTTTGGGTGA AGCAA 1455 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 60 

GGGAATCACT GGAATAGATT AAGAGATGAT GCTAGTAATC TAAGAAATAG AGGTATAACC 120 

GCTATTTGGA TTCCGCCTGC CTGGAAAGGG ACTTCGCAAA ATGATGTGGG GTATGGAGCC 180 

TATGATCTTT ATGATTTAGG GGAATTTAAT CAAAAGGGGA CGGTTCGTAC TAAGTATGGG 240 

ACACGTAGTC AATTGGAGTC TGCCATCCAT GCTTTAAAGA ATAATGGCGT TCAAGTTTAT 300 

GGGGATGTAG TGATGAACCA TAAAGGAGGA GCTGATGCTA CAGAAAACGT TCTTGCTGTC 360 

GAGGTGAATC CAAATAACCG GAATCAAGAA ATATCTGGGG ACTACACAAT TGAGGCTTGG 420 

ACTAAGTTTG ATTTTCCAGG GAGGGGTAAT ACATACTCAG ACTTTAAATG GCGTTGGTAT 4 80 

CATTTCGATG GTGTAGATTG GGATCAATCA CGACAATTCC AAAATCGTAT CTACAAATTC 54 0 

CGAGGTGATG GTAAGGCATG GGATTGGGAA GTAGATTCGG AAAATGGAAA TTATGATTAT 600 

TTAATGTATG CAGATGTAGA TATGGATCAT CCGGAGGTAG TAAATGAGCT TAGAAGATGG 660 

GGAGAATGGT ATACAAATAC ATTAAATCTT GATGGATTTA GGATCGATGC GGTGAAGCAT 72 0 

ATTAAATATA GCTTTACACG TGATTGGTTG ACCCATGTAA GAAACGCAAC GGGAAAAGAA 780 

ATGTTTGCTG TTGCTGAATT TTGGAAAAAT GATTTAGGTG CCTTGGAGAA CTATTTAAAT 84 0 

AAAACAAACT GGAATCATTC TGTCTTTGAT GTCCCCCTTC ATTATAATCT TTATAACGCG 900 

TCAAATAGTG GAGGCAACTA TGACATGGCA AAACTTCTTA ATGGAACGGT TGTTCAAAAG 960 

CATCCAATGC ATGCCGTAAC TTTTGTGGAT AATCACGATT CTCAACCTGG GGAATCATTA 1020 

GAATCATTTG TACAAGAATG GTTTAAGCCA CTTGCTTATG CGCTTATTTT AACAAGAGAA 1080 

CAAGGCTATC CCTCTGTCTT CTATGGTGAC TACTATGGAA TTCCAACACA TAGTGTCCCA 114 0 

GCAATGAAAG CCAAGATTGA TCCAATCTTA GAGGCGCGTC AAAATTTTGC ATATGGAACA 1200 

CAACATGATT ATTTTGACCA TCATAATATA ATCGGATGGA CACGTGAAGG AAATACCACG 1260 

CATCCCAATT CAGGACTTGC GACTATCATG TCGGATGGGC CAGGGGGAGA GAAATGGATG 1320 

TACGTAGGGC AAAATAAAGC AGGTCAAGTT TGGCATGACA TAACTGGAAA TAAACCAGGA 1380 

ACAGTTACGA TCAATGCAGA TGGATGGGCT AATTTTTCAG TAAATGGAGG ATCTGTTTCC 144 0 

ATTTGGGTGA AACGA 1455 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



♦ 



GCCGCACCGT TTAACGGCAC CATGATGCAG TATTTTGAAT GGTACTTGCC GGATGATGGC 60 

ACGTTATGGA CCAAAGTGGC CAATGAAGCC AACAACTTAT CCAGCCTTGG CATCACCGCT 12 0 

CTTTGGCTGC CGCCCGCTTA CAAAGGAACA AGCCGCAGCG ACGTAGGGTA CGGAGTATAC 180 

GACTTGTATG ACCTCGGCGA ATTCAATCAA AAAGGGACCG TCCGCACAAA ATACGGAACA 240 

AAAGCTCAAT ATCTTCAAGC CATTCAAGCC GCCCACGCCG CTGGAATGCA AGTGTACGCC 300 

GATGTCGTGT TCGACCATAA AGGCGGCGCT GACGGCACGG AATGGGTGGA CGCCGTCGAA 360 

GTCAATCCGT CCGACCGCAA CCAAGAAATC TCGGGCACCT ATCAAATCCA AGCATGGACG 420 

AAATTTGATT TTCCCGGGCG GGGCAACACC TACTCCAGCT TTAAGTGGCG CTGGTACCAT 480 

TTTGACGGCG TTGATTGGGA CGAAAGCCGA AAATTGAGCC GCATTTACAA ATTCCGCGGC 54 0 

ATCGGCAAAG CGTGGGATTG GGAAGTAGAC ACGGAAAACG GAAACTATGA CTACTTAATG 600 

TATGCCGACC TTGATATGGA TCATCCCGAA GTCGTGACCG AGCTGAAAAA CTGGGGGAAA 660 

TGGTATGTCA ACACAACGAA CATTGATGGG TTCCGGCTTG ATGCCGTCAA GCATATTAAG 720 

TTCAGTTTTT TTCCTGATTG GTTGTCGTAT GTGCGTTCTC AGACTGGCAA GCCGCTATTT 780 

ACCGTCGGGG AATATTGGAG CTATGACATC AACAAGTTGC ACAATTACAT TACGAAAACA 84 0 

GACGGAACGA TGTCTTTGTT TGATGCCCCG TTACACAACA AATTTTATAC CGCTTCCAAA 900 

TCAGGGGGCG CATTTGATAT GCGCACGTTA ATGACCAATA CTCTCATGAA AGATCAACCG 960 

ACATTGGCCG TCACCTTCGT TGATAATCAT GACACCGAAC CCGGCCAAGC GCTGCAGTCA 1020 

TGGGTCGACC CATGGTTCAA ACCGTTGGCT TACGCCTTTA TTCTAACTCG GCAGGAAGGA 1080 

TACCCGTGCG TCTTTTATGG TGACTATTAT GGCATTCCAC AATATAACAT TCCTTCGCTG 114 0 

AAAAGCAAAA TCGATCCGCT CCTCATCGCG CGCAGGGATT ATGCTTACGG AACGCAACAT 1200 

GATTATCTTG ATCACTCCGA CATCATCGGG TGGACAAGGG AAGGGGGCAC TGAAAAACCA 1260 

GGATCCGGAC TGGCCGCACT GATCACCGAT GGGCCGGGAG GAAGCAAATG GATGTACGTT 1320 

GGCAAACAAC ACGCTGGAAA AGTGTTCTAT GACCTTACCG GCAACCGGAG TGACACCGTC 1380 

ACCATCAACA GTGATGGATG GGGGGAATTC AAAGTCAATG GCGGTTCGGT TTCGGTTTGG 144 0 

GTTCCTAGAA AAACGACCGT TTCTACCATC GCTCGGCCGA TCACAACCCG ACCGTGGACT 1500 

GGTGAATTCG TCCGTTGGAC CGAACCACGG TTGGTGGCAT GGCCTTGA 154 8 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



+ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Asn Ser Asp Ala Ser 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 

Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Ser Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Ala Thr Glu Met Val Arg Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu Val Thr Gly Glu Tyr Thr He Glu Ala Trp Thr Arg Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Arg Leu Asn Asn Arg 

165 170 175 

He Tyr Lys Phe Arg Gly His Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp lie Asn His Val Arg Ser Ala 
245 250 255 

Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala lie Glu Asn Tyr Leu Gin Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Arg Asn lie Phe Asn Gly Thr Val Val Gin Arg 
305 310 315 320 



His Pro Ser His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 



Glu Glu Ala Leu 
340 

Tyr Ala Leu Thr 
355 

Gly Asp Tyr Tyr 
370 

Lys lie Asp Pro 
385 

Gin Asn Asp Tyr 



Gly Asn Thr Ala 
420 

Gly Ala Gly Gly 
435 

Gin Val Trp Ser 
450 

Asn Ala Asp Gly 
465 

lie Trp Val Asn 



Glu Ser Phe Val 



Leu Thr Arg Glu 
360 

Gly lie Pro Thr 
375 

lie Leu Glu Ala 
390 

Leu Asp His His 
405 

His Pro Asn Ser 



Ser Lys Trp Met 
440 

Asp lie Thr Gly 
455 

Trp Gly Asn Phe 
470 

Lys 
485 



Glu Glu Trp Phe 
345 

Gin Gly Tyr Pro 



His Gly Val Pro 
380 

Arg Gin Lys Tyr 
395 

Asn lie lie Gly 
410 

Gly Leu Ala Thr 
425 

Phe Val Gly Arg 



Asn Arg Thr Gly 
460 

Ser Val Asn Gly 
475 



Lys Pro Leu Ala 
350 

Ser Val Phe Tyr 
365 

Ala Met Arg Ser 



Ala Tyr Gly Lys 
400 

Trp Thr Arg Glu 
415 

lie Met Ser Asp 
430 

Asn Lys Ala Gly 
445 

Thr Val Thr He 



Gly Ser Val Ser 
480 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GCTGCGGTGA CCTCTTTAAA AAATAACGGC 30 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCACCGCTAT TAGATGCATT GTAC 24 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
CTTACGTATG CAGACGTCGA TATGGATCAC CC 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GATCCATATC GACGTCTGCA TACGTAAGAT AGTC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TTASGGGCAA GGCCTGGGAC TGG 



(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCCAGGCCTT GCCCSTAAAT TTATATATTT TGTTTTG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GGTTTCGGTT CGAAGGATTC ACTTCTACCG C 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCGGTAGAAG TGAATCCTTC GAACCGAAAC CAG 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGTACTATCG TAACAATGGC CGATTGCTGA CGCTGTTATT TGC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTGTGACTGG TGAGTACTCA ACCAAGTC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTACTTCCCA ATCCCAAGCT TTACCTCGGA ATTTG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAAATTCCGA GGTAAAGCTT GGGATTGGGA AGTAG 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTGAACAACC GTTCCATTAA GAAG 24 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CTCTGTATCG ACTTCCCAGT CCCAAGCTTT TGTCCTGAAT TTATATATTT TGTTTTGAAG 60 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CTCTGTATCG ACTTCCCAGT CCCAAGCTTT GCCTCCGAAT TTATATATTT TGTTTTGAAG 60 



(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
ATGTGTAAGC CAATCGCGAG TAAAGCTAAA TTTTATATGT TTCACTGCAT C 51 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



GCACCAAGGT CATTTCGCCA GAATTCAGCC ACTG 



34 



(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:25: 
TGTCAGAACC AACGCGTATG CACATGGTTT AAACCATTG 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
ACCACCTGGA CCATCGCTGC AGATGGTGGC AAGGCCTGAA TT 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GGCAAAAGTT TGACGTGCCT CGAGAAGAGG GTCTAT 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TTGTCCCGCT TTATTCTGGC CAACATACAT CCATTT 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



CCCAATCCCA AGCTTTACCA YCGAACTTGT AGATACG 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30 
CCCAATCCCA AGCTTTATCT CSGAACTTGT AGATACG 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GATCCATATC GACGTCTGCA TACAGTAAAT AATC 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



GATCCATATC GACGTCTGCA TAAATTAAAT AATC 



